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ABSTRACT 

The SCUBA HAlf Degree Extragalactic Survey (SHADES) is a major new blank-field 
extragalactic sub-mm survey currently underway at the James Clerk Maxwell tele- 
scope. Ultimately, SHADES aims to cover half a square degree at 450 and 850 ^m to 
a 4-cr depth of ~ 8mJy at 850 /^m. Two fields are being observed, the Subaru/XMM- 
Newton Deep Field (SXDF) {Q2Hdr-Qb°) and the Lockman Hole East (10''52™+57°). 
The survey has three main aims: i) to investigate the population of high-redshift sub- 
mm galaxies and the cosmic history of massive dust enshrouded star-formation activ- 
ity, ii) to investigate the clustering properties of sub-mm-selected galaxies in order to 
determine whether these objects could be progenitors of present-day massive ellipti- 
cals, and iii) to investigate the fraction of sub-mm-selected sources that harbour ac- 
tive galactic nuclei. To achieve these aims requires that the sub-mm data be combined 
with co-spatial information spanning the radio-to-X-ray frequency range. Accordingly 
SHADES has been designed to benefit from ultra-deep radio imaging obtained with 
the VLA, deep mid-infrared observations from the Spitzer Space Telescope, sub-mm 
mapping by the Balloon-borne Large Area Sub- millimetre Telescope (BLAST), deep 
near-infrared imaging with the UK Infrared Telescope, deep optical imaging with the 
Subaru telescope, and deep X-ray observations with the XMM-Newton observatory. 
It is expected that the resulting extensive multi-wavelength dataset will provide com- 
plete photometric redshift information accurate to (Sz 5; 0.5, as well as detailed spectral 
energy distributions for the vast majority of the sub-mm-selected sources. In this pa- 
per, the first of a series on SHADES, we present an overview of the motivation for 
the survey, describe the SHADES survey strategy, provide a detailed description of 
the primary data analysis pipeline, and demonstrate the superiority of our adopted 
matched-filter source extraction technique over, for example, Emerson-II style meth- 
ods. We also report on the progress of the survey. As of February 2004. 720 arcmin^ 
had been mapped with SCUBA (about 40% of the anticipated final total area) to a 
median la depth of 2.2 mJy per beam at 850 ^m (25mJy per beam at 450 /J,m), and 
the source extraction routines give a source density of 650 ± 50 sources deg~^ > 3 cr at 
850 ^m. Although uncorrected for Eddington bias, this source density is more than suf- 
ficient for providing enough sources to answer the science goals of SHADES once half a 
square degree is observed. A refined re-analysis of the original 8-mJy survey Lockman 
hole data was carried out in order to evaluate the new data reduction pipeline. Of the 
17 most secure sources in the original sample, 12 have been re-confirmed, including 
ten of the eleven for which radio identifications were previously secured. 

Key words: cosmology: observations - galaxies: evolution - galaxies: formation - 
galaxies: star-burst - infrared: galaxies ~ submillimetre 



1 INTRODUCTION 

Theories of galaxy formation and evolution, embedded 
within hierarchical structure formation model s, can de- 
scribe many of the observed features of galaxie s dCole et alJ 
I2OOOI: iGranato et aljlioool: iHatton et al.ll2003l) . While local 
galaxies can in some cas es still provide constraints on the 
high reds hift populations llPanter. Heavens fc Jimenej20o3 : 
iHeavens'ct al. 200^ . the bulk of the constraints on mod- 
els of galaxy evolution come either from the i ntegral con- 
straint from the far-infr ared background (e.g. iDwek et alJ 
ll998l : lGispert et al.l200(j| . and references therein) , or directl y 
from high-redshift galaxy surveys (e.g. ISteidel et al]|l999l) . 
the most ground- breaking of which were the Canada- France 
Redshift Survey (iLillv et alJll995l) a nd Hubble Deep Field 
North (HDF-Ni lWilliams et alJll996l) . 

Such optical surveys have led to a great deal of progress 
in understanding the assembly of stellar populations, and hi- 
erarchical galaxy formation models are in increasingly good 
agree ment with many (but not all) of these observations 
(e.g. ICole et al.l I2OO0I : ISomerville. Primack fc FabeJ I2OOII: 



[van Kampen, .Jimenez & Peacocl^ I1999I: iKauffmann et all 
^99; Guidordpni ct al 1998; Bl ain et al.lll99gl) . 

However, the discove ry of a substant i al population 
of fa i nt sub-mm galaxies iSmail et alJll99'7l iHughes et alJ 
Il998l : iBarger et alj Il998l) has posed serious problems for 
the current generation of galaxy-formation models based 
on hierarchical structure growth. Models of the optical/UV 
spectral energy distri butions (SEDs) of the galaxy popu- 
lation in the HDF-N jThompson et all 1200 il) predict only 
sub-mJy//^Jy-level 850 /.im flux densities but the sub-mm 
point sources in the HDF-N field have 850 iim flux densities 
of several mJv (iHughes et al.l 119981: ISerieant et all l2003al : 
iBorvs et alj 120031) . This shows that there is a population 
of star forming galaxies that are heavily obscured by dust 
and have much higher infrared luminosities than would be 
inferred from the optical/UV observations alone. If these 
galaxies are at high redshifts (as current data imply), and if 
their emission is powered by star formation with a standard 
solar neighbourhood initial mass function (IMF), then their 
observed 850 ^m flux-densities of several mJy imply star 
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formation rates ~ lOOOMoyr"^. Moreover, the far-infrared 
(FIR) luminosity density implied by the sub-mm galaxy 
population suggests that these infrared-luminous galaxies 
contributed several tens of percent of th e volume- averaged 
star formation de nsity at 2: ~ 2 (e.g. ISmail et alJ Il997t 
iBareer et aPligggTl . 

One attractive interpretation of the sub-mm galaxy 
population is that these violently star-formin g galaxies are 
progenitors of present-d ay massive ellipticals llHuehes et alj 
Il998l: IScott et al.ll2002l) . There are four main pieces of evi- 
dence in support of this dPunlop et al.ll200l e.g.). Firstly, 
the star formation rates inferred from the sub-mm flux den- 
sities are sufBcient to construct the stellar population of even 
the most massive elliptical galaxy in ~ 1 Gyr; secondly, the 
K-hand morphologies of sub-mm-selected galaxies resemble 
those of radio galaxies which locally are hosted in giant ellip- 
ticals llLutz ct al.. 2001,. e.g.); thirdly, the comoving number 
density of bright sub-mm sources in the redshift interval 
z ~ 2-3 is comparable with the p resent-day numbe r den- 
sity of bright > 2-3L* elhpticals llScott et alj|2002ll : and 
fourthly, tentative detections of clustering suggest that the 
sub-mm galaxies trace the collapse of rare, high density over- 
densities at high redshi ft iAlmaini et al.ll2003l : iGreve et alJ 
I2OO4I: iBlain et ahlliooi) . Furthermore, the high dynamical 
masses suggested by CO observations imply massive sys- 
tems, and the gas masses implie d by the CO lumin osities 
suggest extensive star formation llGenzel et alj|20o3) : also, 
the dynamical, gas and stellar masses estimated in the rest- 
frame optical/UV for these galaxies indicate that they are 
both massiv e, gas-rich and already contain significan t stellar 
population JSwinbank et alj|2004l: ISmail et alj|2004) . 

A complication to this interpretation is that theory sug- 
gests a less direct relation between local galaxies and their 
high-redshift antecedents. CDM simulations inevitably pre- 
dict that massive galaxies today are assembled hierarchi- 
cally from a large number of smaller fragments that existed 
at high redshift. Conversely, the majority of the rare early- 
collapsing objects at z ~ 2-3 should be foun d inside massive 
galaxies at the present iBaueh et al.|[l993) . If this is true, 
then it suggests a more detailed set of questions that new, 
larger sub-mm surveys should attempt to settle; (1) what 
fraction of present-day massive ellipticals have merged with 
at least one SCUBA galaxy?; (2) what fraction of SCUBA 
galaxies will end up in a present-day massive elliptical? (3) if 
the answer to the second question is close to 100%, how close 
are the SCUBA g alaxies to the end of the merger process? 
There are claims llBell et alj|2004f) that the total mass of 
stars in ellipticals has roughly doubled since redshift 2 = 1. 
If this increase applies to the most luminous ellipticals, this 
would mean that most present ellipticals were only in the 
earliest phases of assembly at z — 2-3. There is thus some 
uncertainty over whether a given SCUBA galaxy represents 
a late phase in the construction of an elliptical, or an early 
phase in the assembly of its nucleus. Uncertainties in the true 
star-formation rate and the possible lifetime of the starburst 
contribute to this uncertainty, which can only be addressed 
statistically by looking at the population as a whole. 

Detailed models of the hierarchical assembly of galaxies, 
based on standard prescriptions for gas cooling, star forma- 
tion and feedback, in general do not predict large numbers of 
SCUBA galaxies with star-fo rmation rates ~ lOOOMeyr"^. 
However. iBaugh et al.l (l2005ll have shown that the observed 



number counts and redshift distributions of the sub-mm 
sources can be reproduced in CDM models if the star for- 
mati on in these ob jects is occurring with a top-heavy IMF 
(e.g. lLarsonlll993) . implying that the same sub-mm flux 
can be produced with star formation rates ~ 5 times lower 
than for a Salpeter IMF. An alternative model that can ex- 
plain the sub-mm galax i es wit hin the framework of CDM 
is that of iGranato et all i2004f) . who propose modifications 
to the treatments of gas cooling and feedback, as opposed 
to modifications to the IMF. Finally, the treatment of viri- 
alization and the survival of subhaloe s in the model of 
Ivan Kampen. Jimenez fc Peacock il999l) produces star for- 
mation histories which allow for much higher star forma- 
tion rates at early times (especially in bursts), and there- 
fore predi cts sufficient numbe rs of sub-mm galaxies at high 
redshifts ivan KampenI l2003l) . SHADES wiU provide the 
means to distinguish bet ween these and other alternatives 
ivan Kampen et alJl2005^ . 

Of course, the radical possibility that the hierarchi- 
cal orthodoxy may be flawed in some way is worth keep- 
ing open, in order that the standard theory can be prop- 
erly verified. Furthermore, even the interpretation of all 
SCUBA sources as dusty starbursts could still be subject 
to future revision. Although the majority of the bright sub- 
mm sources have secure extragalactic identifications, and 
in many cases redshifts, it remains possible (albeit increas- 
ingly unlikely) that a fraction of sub-mm objects could be 
Galactic in origin ijSciama 2000; Lawrence. 20.01ii) ■ Even ac- 
cepting that the sources a re extragalactic, the starburst 
model is n ot unchallenged; lEfstathiou fc Rowan- RobinsonI 
i2003h and lKaviani et all (l2003ll have successfuUv modelled 
the emission of some sub-mm sources as extended cirrus 
in galaxies heated by the interstellar radiation field. This 
is possible partly because the far-infrared colour tempera- 
tures are not well-constrained with existing data. This in- 
terpretation issm by observational evidence from 
iFarrah et allfeooi) . using sub-mm detections of a galaxy at 
z ~ 0.5 that hosts a Type la supernova. The rest-frame opti- 
cal luminosities and colours of sub-mm galaxies require such 
cirrus galaxies to be more heavily extinguished than their 
local counterparts - Efstathiou & Rowan- Robinson finding 
a best fit extinction co-efficient of Av — 1-3 for the high 
redshift galaxies as opposed to Av — 0.4-0.9 for the lo- 
cal galaxies in their sample. Such an interpretation may be 
testable using high-resolution observations with the Plateau 
de Bure Interfermometer, and will be conclusively answered 
using the Atacama Large Millimeter Array (ALMA) with a 
spatial resolution of < 0.1". 

At present, the determination of the redshift distribu- 
tion and clustering of the sub-mm galaxy population of- 
fers the best available method to constrain the proper- 
ties of the over-densities hosting bright sub-mm galaxies, 
and hence to diff erentiate between alterna tive models of 
galaxy evolution ivan Kampen et al.l |2005^ . To this end, 
the SCUBA HAff Degree Extragalactic Survey (SHADES, 
http://www.roe.ac.uk/ifa/shades/) consortium is map- 
ping 0.5 deg"^ with the Sub-millimetre Common User 
Bolometer Array (SCUBA) at the James Clerk Maxwell 
Telescope (JCMT) on Mauna Kea, Hawaii. SHADES ex- 
pects to produce a complete sample of ~ 300 bright sub- 
mm sources at 850 /im. The survey is being carried out in 
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fields with abundant supporting multi-wavelengtii data (see 
Section f3.1l for full details). 

In this paper, the first in a series of papers on SHADES, 
we present an overview of the motivation for the survey, dis- 
cuss the adopted observing strategy, and describe the oper- 
ation of, and first results from the primary SHADES data 
reduction pipeline. However, we stress that three additional 
and independent reductions of the SHADES data are cur- 
rently underway within the consortium, and that we aim to 
take advantage of these multiple reductions to maximise the 
robustness of the final 850 /xm source list. These alternative 
reductions, and the outcome of cross-referencing the result- 
ing maps and source lists will be presented elsewhere. The 
current paper is structured as follows. Section|5|describes the 
aims of the SHADES survey in more detail, and Sections |3 
and|l]present the data acquisition and analysis methods be- 
ing used. Section 1^ outlines the source-extraction methods 
under development for this large survey and in Section|S|we 
present the progress of the survey so far and Section Q con- 
cludes the paper. Throughout, we assume a cosmology with 
Q,M = 0.3, Q,A = 0.7 and a Hubble constant of Ho = 72 km 
S-' Mpc"\ 

2 MOTIVATION FOR THE SURVEY 

2.1 Background: The problem of 
cross-identifications 



The faint optical/ near-infrared ide n tifications of sub- 



mm galaxies fe. 
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Borvs et al.ll2004^ and the broad f~ 10- 



15") beams of the largest current sub-mm/mm-wave tele- 
scopes, together present difficult ies for the unambi guous 
identification of sub-mm galaxies ijHuehes et al.lll99Sf l. 



However, extensive lon g-term efforts towa rds identifica- 
tion made in the radio fe.g. llvison et alJl2004l), supported in 
some cases by mm-wave interferometry (e.g. iDownes et alJ 
ll999l:lGear et al.ll200nl : iLutz et aljl2001l) . have produced ra- 
dio identifications for ~ 50-70% of the brighter sub-mm 
sources. These radio detections have been successfully ex- 
ploited to derive accurate (i.e. sub-arcsec) positions for sub- 
mm galaxies, thus facilitating further spectroscopic study. 
As a result, spectroscopic redshifts for approximately 90 
radio-detecte d sub-mm s ources have been published to date 
ilvison et al ]|i998, 200*01: IChaoman et alj |2003. 20oJ). 

The spectroscopic follow up is of course biased against 
those sources at redshifts where no spectral features fall 
within the spectroscopic range, most notably 1.2 < z < 1.7, 
and may also exclude the highest redshift objects (due to the 
less favorable K-correction in the radio waveband compared 
to the sub-mm for redshifts 2 > 3). Together these effects re- 
sult in moderate incom pleteness in the final redshift surveys 
ijChaoman et aljl2005^ even if all identifications are robust. 

Recently, rapid detections of SCUBA galaxies have been 
made with the Syitz er Space Telescope , in integrations of 
only ~ 10 minutes jEgami et all 12004 llvison et al J 120041: 
ISerieant et al.l l2004l:lFraver et alJl2004f l( ^harmandaris et alJ 
l2004fl . This small sample of identifications, in conjunction 
with the abundant Spitzer coverage of our fields, shows 



the potential for identification and follow-up of the sub-mm 
sources. 

Despite the radio and the Spitzer data, some sources 
may remain unidentified. Lack of a robust radio or mid- 
infrared indentification could have five origins: (i) the sub- 
mm source could b e spurious; (ii) the source could be 
severely flux boosted jEddingtonll9ll see SectionEU; (in) 
the radio/far-IR emission could be significantly more ex- 
tended than the VLA synthesised beam; (iv) the character- 
istic dust temperatures could be low; or (v) the source could 
be at very high redshift. 

Nevertheless, armed with lower resolution observa- 
tions with the VLA, along with the low-resolution sub- 
mm imaging to be provided by the B alloon-borne Large 
Area Sub-milhmetre Telescope fBLAST. lHughes et al.l2002l. 
see Section [3.11 and mid-infrared imaging from the Spitzer 
Wide- Area Infrared Extrag alactic Legacy Survey (SWIRE, 
iLonsdale et alJ l200l I2OO4I see Section 13.111 we anticipate 
being able to distinguish between these five alternatives for 
most of the apparently unidentified sources. 

The large positional uncertainty of the SCUBA sources 
may also lead to unreliable identifications. A measure of this 
is the cross-identification limit, which we define to be one 
random source per 10 search areas. At this surface den- 
sity the likelihood of a spurious identification is given by 
P<1- exp-^'->^'>'""^ ~ 0.1, where r is 7" (the half width 
half maximum of the JCMT beam at 850 /im), and A''(> S) 
is the cumulative source counts of other objects in the identi- 
fication catalogue with fluxes greater than S. For the Spitzer 
IRAC 3.6, 4.5 /im bands, the resulting cross-identification 
limits are 58 and 60 /iJy respectively, well above the SWIRE 
sensitivity and confusion limits (shown in Table 0. For the 
IRAC 5.8 and 8.0 nm and Spitzer MIPS 24^m bands, the 
cross-identification limits are 48, 44 /xjy and 120 fj,Jy, simi- 
lar to the S WIRE sensitivities but still well above the con- 
fusion limit jFazio et alJl2004l; iMarleau et alJl2004^ . The 70 
and 160 /xm cross-identification limits of 0.93 and 9.1 mjy 
are well below the c onfusion and SWIRE sensitivity lev- 
els (iDole et alj|2004ft . This highlights the need for careful 
analyses to identify the SHADES galaxies at wavelengths 
shorter than 24 /xm. Our abundant multiwavelength cover- 
age, especially optical and near-infrared, can be used to find 
objects by looking at outliers in colour- space iSmail et alJ 
120021: IWebb et alJl2003l: IPope et aljEoOsl. e.g.). 

To illustrate the potential power of the multi- 
frequency /multi-facility approach adopted for SHADES, we 
have investigated the properties of four template galaxies, 
whose SEDs have been normalised to the SHADES survey 
depth of 8mJy at 850 /xm, as the assumed redshift is var- 
ied. Table0shows the flux densities expected from our four 
template galaxies. Note the similarity of the SWIRE limits 
to the Arp 220 model SED flux densities and also to the 
depth required for reliable identifications. Fig.Q shows the 
four example model SEDs as the assumed redshift is varied 
from z — to z = 4. The key points illustrated by these 
models can be summarized as follows. 

i) The Arp 220 SED with the SHADES normalisation shows 
that BLAST detections of Arp 220-like SEDs should be pos- 
sible to 2 ~ 2 — 3 and that detections in 3.6 and 4.5 ^m in 
conjunction with non-detections at 5.8, 8 and 24 ^m can be 
used to differentiate Arp 220-li ke SEDs from other forms. 
The relative number counts of iHuang et alJ (|20o3) show 
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that h igher wavelength drop-outs do occur: lEe Floc'h et alJ 
i2004f) . using preliminary Spitzer MIPS (24 ^m) results, sug- 
gest that the SEDs of high redshift (z > 1) sources in the 
Lockman Hole East and the Extended Groth Strip are well 
fitted by an Arp 220-like model. 

ii) M82 is another prototypical star-forming galaxy and 
galaxies with M82-like SEDs should be visible at BLAST 
depths out to z ~ 3 and Spitzer SWIRE depths out to z ~ 4 
using the IRAC (3.6-8 fim) wavebands. 

iii) The SED of HRIO, a high-redshift, extremely-red galaxy, 
is well matched out to redshifts of 2 ~ 2 — 3 with BLAST 
and z ~ 3—4 at shorter wavelengths. HRIO would be defined 
as a ultraluminous infrared galaxy (ULIRG) by its infrared 
luminosity. 

iv) The SED of NGC 1068, dominated by an AGN-heated 
dusty torus, is discussed in more depth in Section f2. 2. 31 

These models illustrate that the mid-infrared and far- 
infrared flux-density limits for the coverage of the SHADES 
fields will detect sub-mm galaxies to redshifts z J: 4, compa- 
rable to that from the deepest radio integrations. We can 
confirm these expectations using the properties of exist- 
ing SCUBA galaxi e s which have been detected at 24 /xm 
jEgami et alJ l2004t iFraver et aDl2004^ . These have been 
found to have 24 fim flux densities ranging from 80 to 
2800 /iJy, although the median flux is 230 /iJy and half of 
the detections lie in the range 160 — 370 /iJy. Such measure- 
ments compare favourably with the anticipated SWIRE 5 a 
survey limit of 105 fiJy at 24 fim (Lonsdale et al. 2004). The 
deeper Spitzer GTO data may also detect many of the sub- 
mm galaxies at other wavelengths. 



2.2 SHADES science goals 

2.2.1 Measurement of the cosmic history of massive 
dust- enshrouded star-formation activity 

A key constraint in the phenomenology of galaxy forma- 
tion is the cosmic history of dust-enshrouded star forma- 
tion, observed via the evolution of far-infrared luminos- 
ity density. The flat sub-mm selection function is well- 
known to give sub-mm galaxy surveys an informative con- 
straint on the far-infrared lumin osity density throughout 
most of the Hubble v olume (e.g. iFranceschini et aPllQQll 
iBlain &i LongaiJ Il996l) . Spectroscopic redshifts or photo- 
metric redshift estimates are clearly essential to constrain 
the evolving far-infrared luminosity density, but the iden- 
tifications have proved challenging (see above). Moreover, 
even when secure identifications are available, the optical or 
near-infrared spectroscopic follow-up observations are time- 
consuming, typically requiring > 2 hours of integration on 
each target with 8-lOm-class telescopes, and arc not guar- 
anteed of success (e.g. ,CliaDnian ct al. 2003; Simpson ct al 
|2003). 

However, even in the absence of an optical or near- 
infrared identification photometric redshifts can still be 
derived on the basis of long-wavelength (i.e. radio-to- 
far-infrared) photometry. The simplest of these meth- 
ods uses a single colour ratio between observations at 
850 /^m and 1.4 GHz JCariUi fc Yunll99gl : lDunne et aj2000l: 
iRengaraian fc Takeuchill200 j) to discriminate between low 
and high-2 star-forming galaxies. This method applied to a 



sample of 30 sources dlvison et alj|2002l) produces estimated 
redshifts accurate to Sz ~ 0.4-1.5. 

Despite the dust temperature - reds hi ft de- 
generacy present in F IR sources iBlainl Il999bt 

iBlain. Barnard fc ChapmanI 120031) . extension of this 
method to include multiple colour information in the radio 
to sub-mm bands allows the derivation of ph otometric red- 
shifts with accuracies of (5z ~ 0.5 or better jYun fc CariUi 
[ 20021: iHuehes et alj 120021: lAretxaga et al.l boOS . .20051 
Iwiklindll200ir even taking into account the full range of 
dust properties present in the local galaxy analogues. The 
most recent comparison of photometric and spectroscopic 
redshifts for blank field sub-mm sources yield a dispersion 
of Sz ~ 0.3 w hen three or more long -wavelength detections 
are available llAretxaga et alj 1200 Sl) . Uncertainties due to 
the temperature-redshift degeneracies could be further 
reduced by performing spectrosco pic calibration up on a 
representative subset of the sample (iBlain et al.ll2004l . 

For those sources for which near-infrared or mid- 
infrared identifications are secured, a complementary (and 
in principle independent) method of estimating redshifts 
is to use the Spitzer Space Telescope and correspond- 
ing ground based near-infrared observations to identify 
the position of the redshifted 1.6 /im peak of the near- 
infrared stellar continuum u s ing the IRAC 3-8 fim bands 
JSimpson fc Eisenharddll999l : ISawickilliool . This method 
gives an indication of which objects lie at redshift z > 1.5 
and obtains redshift estimates accurate to Sz ~ 0.5 at 
z < 1.5 independent of the BLAST and VLA measurements. 
Where optical photometry is available, the p hotometric red- 
shift is more accurate still dPope et al.ll2005ll . 

Given the need to break the sample into a few redshift 
bins, while maintaining statistically useful numbers in each 
bin, we require of the order of 100 sources in total in order 
for the A'^(z) hist ograms to be able to diffe rentiate between 
available models (Ivan Kampen et"ai]|2005l) . The combined 
multiwavelength data are expected to yield a redshift reso- 
lution of better than Sz ~ 0.5 for 300 galaxies, using spec- 
troscopically calibrated photometric redshifts (Aretxaga et 
al. 2005). 



2.2.2 Determination of whether SCUBA galaxies are 

progenitors of present-day massive elliptical galaxies 

The high inferred star formation rates of sub-mm galax- 
ies, while consistent with expectations for proto-ellipticals, 
nevertheless do not provide unambiguous evidence that they 
are progenitors of massive elliptical galaxies. This is both be- 
cause the duration of the starburst is unknown, and because 
the derived star-formation rate is sensitive to the assumed 
temperature and initial mass function of the mass distribu- 
tion (Larson 1998; Baugh ct al. 2 0051) . In contrast, the clus- 
tering of bright SCUBA sources on scales of up to ~ 10 Mpc 
offers a potentially very p owerful constraint on the nature 
of the sub-mm population llPercival et al.ll2003l) . given even 
a relatively broad {Sz ~ 0.5) constraint on the estimated 
redshifts of individual sources (van Kampen et al. 2005). 
As discussed in van Kampen et al. (2005), to sample the 
sub-mm population over such scales requires a degree-scale 
survey (1° is 29 Mpc at z = 2 in our adopted cosmology). 
Therefore, the desire to probe scales approaching 10 Mpc 
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provides one of the primary motivations for the eventual 
areal coverage goal of SHADES (i.e. half a square degree). 

Previous blank sky surveys carried out with SCUBA 
have identified a total of ~ 100 sources in disparate areas on 
the sky. These survey fields include the Hubble Deep Field 
jHughes et alll998l:ISerieant et al.l2003al : lBorvs et aj2003l: 
IWang et alj|2004l: iBorvs et alJl2q04f) . the Hawaii deep field 
regions ' (Eetfge^taLl^O^ i' ^Q'gll . the Lockma n Hole East 
and the Elais N2 re gion in the 8-mJy Survey dScott et alJ 
l2002l:lFox et al .l2002ll. the Spitzer northern continuous view- 
ing zone dSawicki fc Webbll2005^ ■ and the Canada-France 
Redshift Survey (CFRS) fields b y the Canada UK Deep 
Sub-millimeter Survey (CUDSS; lEales et al.lll999ll . How- 
ever, the resulting composite existing 'sample' of sub-mm 
sources spans a wide range in intrinsic luminosity and is 
distributed between many small fields imaged to varying 
depths. As a result it has proved impossible to derive unam- 
biguous constrain ts from the apparent clustering stre ngth 
of these sources jBorvs et al.l 120031: IScotteraOliQoi). By 
providing a complete and homogeneous sample of the most 
luminous sub-mm sources in two wide-area fields, SHADES 
aims to provide the first robust constraints of the clustering 
properties of the sub-mm galaxy population. 



2.2.3 Determination of the fraction of SCUBA sources 
that harbour obscured active galactic nuclei 

The Spitzer 3. 6-160 Atm data from the SWIRE and 
GTO survevs (iLonsdale et al] l200l l2004l : lEgami et all 
I2OO4I : iHuang etan ~ l200^ sample the spectral energy 
distributions in the rest-frame near-infrared and mid- 
infrared. The latter is sensitive to the presence of AGN 
dust tori, making the Spitzer data key to determining 
the AGN bolometric fractio n in these hig h-infrar ed- 
luminosity galaxies (e.g. [Mmaini^Law^nc^fc Bo^^ 
|l999|: [Efstat hiou. Rowan- Robinsoi^^iBbenmo^reiTl2^ 
l^mai^^dTl2002l) . Most torus models show warm colour 
temperatures which therefore would not contribute signifi- 
cantly to the sub-mm fiux. 

X-ray visible AGN and sub-mm sour ces are only rarely 
coincident in shallow X-ray observation s (|BaArtz_et_ajJ|200(i 
iFabian et aI.l2000l:IWaskett et al.l2003l : lAlmaini et alfcooar) 
and yet seem to trace similar structures on arcminute scales 
in the Elais N2 field (lAlmaini et alJl2003fl . Possibly, these 
two populations represent different, relatively short-lived, 
phases in the formation of massive objects at high redshift. 
Alternatively, the majority of SCUBA sources may contain 
a massive and active black hole that is too heavily ob- 
scured to be detected with the current X-ray surveys. Such 
Compton-thick objects may be associated with the forma- 
tion of super-massive black holes. Some X-ray objects have 
been found to have a sub-m m sou rce associated with them 
(e.g. iBarger et ah 20o'lh and lAlexander ot al, (,.2003. 20o3) 
suggest that even those sub-mm galaxies without an X-ray 
counterpart could contain low luminosity AGN. However, 
as Alexander et al. point out, the sub-mm emission in these 
galaxies would be dominated by star formation and not the 
AGN, so any estimates of star formation rates would not be 
affected. 

The NGC 1068 SED in Fig.[T] normalised to the 
SHADES sensitivity, demonstrates that hyperluminous 
AGN dust tori in the SHADES survey, with the AGN dom- 



inating even the sub-mm flux, should be detectable in the 
Spitzer 24 - 160 /im and BLAST 250 - 500 /xm bands at 
redshifts of up to 2 = 2. In this extreme limiting case, the 
SED is dominated by the dust torus, with little contribu - 
tion from circumnuclear star formation. Ilvison et alj (JioO^) 
and lEgami et al.l (|22q3) have already shown that Spitzer 
24 /im photometry is efficient at demonstrating the presence 
of AGN in cases where the AGN makes a much smaller bolo- 
metric contribution, compared to that of the star formation; 
this AGN detection is particularly eff ective when combi ned 
with photometry in the IRAC bands ilvison et al.ll2004l) . 



3 DATA ACQUISITION 
3.1 The survey fields 

The half square degree to be covered by SHADES is split 
between two survey fields - the Lockman Hole East (field 
centre approximately lO*" 52"" 28° -^57° 22' 20") and the 
Suha.ru/XMM-Newton Deep Field (SXDF, 02'" 18™ 00' 
-05° 00' 00"). The fields were chosen for their low Galactic 
cirrus (100 ^m surface brightness ~ IMJysr"'^) and benefit 
from abundant cospatial multi-wavelength data. The dec- 
lination of the source fields and their spread in RA make 
these fields observable for the majority of the year using the 
JCMT as well as being accessible to BLAST, Spitzer, the 
VLA, UKIRT, Subaru, Keck and Gemini telescopes. The 
decision to include an equatorial field was partly driven by 
the desire to provide a sub-mm source catalogue accessible 
to ALMA. 

Observations in the Lockman Hole East are being ex- 
tended around the 15 1 arcmin^ observ ed as part of the 
SCUBA 8-mJy Survey iScott et al.l2002^ . The direction and 
shape of this extension is driven primarily by the existence 
of ultra-deep VLA imaging at 1.4 GHz. 

The specific choice of the SXDF as the equatorial field 
was motivated by the existence of deep yiMM-Newton imag- 
ing, associated deep VLA observations at 1.4 GHz, and ex- 
isting deep multi-colour optical imaging obtained with the 
Subaru telescope. 

The near-infrared (J, H, K) imaging for both fields will 
be provided by the new UKIRT WFCAM instrument as part 
of the UK Infrared Deep Sky Survey (UKIDSS). Specifically, 
the Ultra Deep Survey (UDS) component of UKIDSS will 
cover 0.77 deg^ to A' ~ 23 in the SXDF, while the Deep Ex- 
tragalactic Survey (DXS) in UKIDSS wiU map the SHADES 
Lockman field to A" ~ 21. 

Further multi-wavelength coverage comes from BLAST 
(500, 350 and 250 /im) surveys in our fields and two Spitzer 
surveys - the SWIRE Legacy Survey (iLonsdale et alj|200l 
|2004[) and the Gua r anteed Time Obser vations (GTO) data 
iEgami et all l2004t iHuang et al]|2004l) . The BLAST sur- 
vey is expected to be confusion limited at all wavelengths, 
corresponding to a 5a flux density at the confusion limit 
of approximately 25 m jy in all three bands (derived from 
iRowan-RobinsonI 1200 ih . The SWIRE survey fiux density 
limits are shown in the first row of Table Q Various pointed 
foUowups are also underway including using SHARC-II at 
the Caltech Sub- millimeter Observatory (CSO). 

The BLAST 500 /im data are expected to be deeper 
than the SCUBA 450 /im data because the SCUBA obser- 
vations are deliberately restricted to be conducted in only 
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Figure 1. Model spectral en ergy distribution plot s of template galaxies with survey sensitivities overplotted. Filled circles: SWIRE 
sensitivities 500 seconds, 5(7 iLonsdale et alj|200'j) . Open triangles: The pre-flight Spitzer sensitivities for 70 and 160 /im are shown 
whilst the SWIRE team rc-assess their observing strategy due to the degraded MIPs sensitivities, although currently t hese are expected 
to be two to three times worse. Filled tria ngles: The pre-flight BLAST confusion limit estimates (derived from iRowan-RobinsonI 
|2001^ of 25 mjy; the sensitivity estimates jHughes et al.l l2002^ are in the range 15 — 25mjy. Filled diamonds: SHADES survey 
sensitivities of 8 mJy at 850 fira and 75 mJy at 450 /im. Models have been normalised to the SHAD ES surv ey depth of 8 mJy at 
850 fj,m. Top left panel: Arp 220 - a highly-o bscured local starburst galaxy (no te: it is also a ULIRG lElbaz et al. 2003)- ^"P right 
panel: NGC106 8 - a typical Seyfert galaxy f Efstatliiou. Hough fc Youndll993) . Bottom left panel: M82 — an irregular dusty star- 
forming galaxy jE^tathiou. Rowan- Ro binson fc Sicbcnmorecn „200n|) . Bottom right panel: HRIO - a high-redshift, extremely-red galaxy 
jTakagi. Hanam'^^^jimoto[l20o3) ~Bolometric f3-100o7mi rest-frame) luminosities for each model, as normalised, are shown in the 
legends. 



grade 2-3 weather (r225GHz — 0.05 — 0.10; this restriction 
was adopted to allow other, smaller programmes on the 
JCMT to best exploit the rare, grade-1, conditions). Because 
the 450 nm SCUBA data will be of only moderate quality, 
we expect few sources to be detected in the 450 nm maps. 
Therefore, in this paper, we focus primarily on the 850 ^m 
data. 



3.2 SCUBA technical information 

SCUBA is composed of two arrays of bolometers that view 
the same region of sky simultaneously, a long wave array 
of 37 bolometers used at 850 fim and a short wave array 
of 91 detectors used at 450 fim. The pixels are arranged in 
a hexagonal pattern, with the feedhorns close-packed. The 
bolometer performance improves with decreasing tempera- 
ture, so SCUBA has a helium cooling system to improve 
detector sensitivity. Thermal noise from the sky and local 
surroundings dominates at millimetre wavelengths. 



The SHADES survey is conducted in jiggle-mapping 
mode. The SCUBA bolometers instantaneously undersam- 
ple the sky but are dithered in a 64-point pattern to ensure 
that, overall, the sky is sampled at the Nyquist frequency or 
better at both wavelengths. Further details on SCUBA can 
be found in iHoUand ct aL (,1999,). 

The terrestrial atmosphere and thermal emission from 
the telescope both contribute to a strong background (~1 Jy 
per square arcsec) and the atmospheric part of this emission 
varies rapidly. By rapidly chopping the secondary mirror and 
nodding the entire telescope these effects are reduced, leav- 
ing a residual atmospheric noise which is common-mode to 
and therefore uniform across the whole array of bolometers. 

Jiggle-maps are coadded to improve the signal to noise 
ratio (hereafter S/N) allowing measurements of signals that 
are tens to hundreds of thousands of times fainter than the 
background. 
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Table 1. Comparison of the Spitzer survey flux density limits with galaxy populations normalised to the SHADES 
depth of 8 mJy at 850 /im. Also listed are the effective cross-identiflcation limits (as described in the text). 



Wavelength 


3.6 /i.m 


4.5 


5.8 


8.0 Mm 


24 Mm 


70 Mm 


160 Mm 


SWIRE 5cr in 500s t 


3.7/iJy 


5.3 fj.]y 


48 MJy 


37.7 MJy 


106 MJy 


6 mjyt 


SOmJyt 


Sconf 


1.5 /iJy 1 


1.5AtJy ^ 


8.3AtJy^ 


5.4 MJy^ 


88 MJy 'I 


5.3mJy§ 


48 mjy§ 




1.7^ 


I.4I 


0.84 H 


0.81^ 


O.I3II 


0.009§ 


0.003§ 


ARP 220 ^ = 2 * 


22/iJy 


23 AtJy 


26 AiJy 


17MJy 


0.21 mJy 


3.7mJy 


62 mJy 


ARP 220 z = 4 * 


3.5/iJy 


5.2AtJy 


6.lMJy 


6.9 MJy 


6.0 MJy 


0.17mJy 


4.3 mJy 


NGC 1068 z = 2* 


9.6 /iJy 


22 ^Jy 


55 fj.Jy 


0.16mJy 


3.3 mJy 


16 mJy 


32 mJy 


NGC 1068 z = 3 * 


2.6 /iJy 


S.g^tJy 


10 MJy 


32 MJy 


0.86 mJy 


5.4 mJy 


12mJy 


M82 2 = 2 * 


74 /.Jy 


0.11 mJy 


0.15 mJy 


0.11 mJy 


1.4 mJy 


9.6 mJy 


58 mJy 


M82 2 = 4* 


4.2 /ijy 


10 MJy 


19 MJy 


36 MJy 


38 MJy 


1 . 1 m Jy 


5.3 mJy 


HRIO z = 2* 


YlfiJy 


0.12 mJy 


0.15 mJy 


0.11 mJy 


0.36 mJy 


6.6 mJy 


44mJy 


HRIO z = 4 * 


2.4 /.Jy 


6.6 fj.3j 


19 MJy 


39 MJy 


20 MJy 


0.28 mJy 


5.0 mJy 



t SWIRE limits from Lonsdale et al. (20041 . 

The SWIRE team is currently re-assessing the observing strategy due to the degraded MIPS sensitivities. 
The values shown here for 70 and 160 Mm sensitivities are pre-flight estimates only. 

Confusion limit from observed source counts, of one source per 40 SCUBA beams (7" radius circle) 
tNumber of observed sources per SCUBA 850 Mm beam (7" radius circle) greater than the SWIRE limit. 
^Number counts from^^gi^^^^^ (2^Q^ 
II Number counts from^arleai^^^^ Q004) 
§ Number counts from^ol^^^a^ (200^ 



3.3 The observing strategy 

The survey makes use of close-packed hexagonal geometry 
to place jiggle-maps in an interleaved positioning scheme. 
This provides as uniform a noise level across the survey field 
as practicable, as well as ensuring that each sky position 
is covered by multiple bolometers. Fig.|21 shows one inter- 
leaved 'tripos' pattern, illustrating the central triangular re- 
gion covered uniformly by the three hatched jiggle-maps. 

The tripos system of three overlapping maps is observed 
for each of six different chop-throw and chop position an- 
gle (PA) combinations. This is a significant departure from 
the previous survey strategy most similar to SHA DES, that 
adop ted for the 8-mJy survey fe ott ct al. 2 003 ; IFox et alJ 
|2002)- We specifically chose the six chop th rows motivated 
by the Emerson-II chop throw methodology (lEmersonll995l) 
i.e. chop throws of 30 ", 44 ", 68 ", at PAs of zero and 90 de- 
grees in right ascension/declination co-ordinates. This choice 
of chop throw and PA ensures that no Fourier modes larger 
than the beam and smaller than the largest chop throw are 
lost entirely. The source extraction from these multiple chop 
throw maps is discussed below (Section |^ . By observing 
each of the chop throw/PA combination maps at different 
airmasses, the noise levels in each tripos should ensure an 
even coverage. Thus, one chop throw / chop PA combination 
will always be observed within a particular airmass range; 
while it would be possible to balance the airmasses evenly 
over the chops in order to avoid correlating the chop strat- 
egy with airmass, our adopted strategy makes observing de- 
cisions easy enough to maintain a low error rate. In addi- 
tion, the jiggle-maps at each tripos position are observed in 
a priority order which keeps the survey area approximately 
circular at any time, in order to minimise the perimeter area 
which will have lower signal-to-noise. 

In the SXDF, the observing strategy has been to extend 
the region from the centre of the field in a spiral manner. The 



area of Lockman Hole East already covered by the 8-mJy 
survey has not been repeated, since the noise level of this 
area is already at the required depth. Therefore, as discussed 
above, the new SCUBA data have been extended around the 
existing 8-mJy Lockman Hole East data. 



4 THE DATA REDUCTION PROCESS 

We discuss here the Interactive Data Language (IDL) reduc- 
tion pipeline developed for SHADES from the original 8-mJy 
survey reduction process iScott et al.ll20o3 : ISerieant et alJ 
^OOM. This process fiat-fields the data, combines the data 
from the individual chops and nods, and corrects for the 
effect of the atmosphere using an extinction correction rou- 
tine. The pipeline also corrects for noisy bolometers and fast 
transient spikes in the data, e.g. cosmic rays (see below). In 
order to extract spatial information, the jiggle-map data can 
be regridded onto rectangular co-ordinates. The improve- 
ments and variations on the 8-mJy survey pipeline made for 
the SHADES data are listed below; for furthe r details of this 
reduction method (see lSerieant et alll2003ah . 

4.1 Extinction corrections 

Sky absorption caused by water vapour in the atmosphere 
was previously removed from the data using smoothed CSO 
sky-dip measurements extrapolate d to 850 and 450 ^lm (u s- 
ing the conversion relations give bv I Archibald et all ||2002|) ) 
or by SCUBA sky-dip measurements when insufficient CSO 
data were available. CSO tan sky monitors take sky-dip 
readings at 225 GHz every 10 minutes. SCUBA sky-dips 
have the benefit of being at the observation wavelength 
but are taken at a different time to the observations. The 
new pipeline pref erentially uses the w ater vapour radiome- 
ter data (WVM, IWiedner et al]|200ll) . available since July 
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Figure 2. The 'tripos' positioning scheme. Individual jiggle- 
maps (2.6' diameter) are represented as large hatched circles; 
only three of these are plotted for clarity, but the centres of the 
others are marked with small open circles. This pattern of over- 
lapping jiggle-maps, in conjunction with observing different chop 
throw/PA combinations at different airmasses, provides an ap- 
proximately uniform coverage over the map. These three hatched 
jiggle-maps, with centres marked by filled circles, comprise one 
'tripos'. Only these three positions contribute to the data within 
the triple-hatched triangle formed by the filled circles. Any other 
triangular region is covered by its own combination of three jiggle- 
maps. The jiggle-map positions shown by the open circles are ob- 
served sequentially in a spiral manner from the centre of the map 
outwards; the resulting roughly circular areal coverage minimises 
the perimeter, which has lower coverage and so lower S/N than 
the rest of the survey. 

2000, that are taken at the same time and azimuth as the ob- 
servation at a frequency of 183 GHz every six seconds. These 
WVM data have been converted into 225 GHz values, allow- 
ing for the elevation corrections, by Weferling (priv. comm.). 
The WVM data are smoothed over 18 seconds - a timescale 
similar to that of the nod - and applied to the data. We 
have undertaken a comparison of the WVM, SCUBA sky- 
dip and CSO sky-dip data, and find that they agree to within 
10% when converted to 225 GHz opacity. This corresponds 
to derived flux-density uncertainties of a few percent. The 
sky extinction correction from the WVM has the advantage 
of being able to reliably fit variations which occur over the 
course of a single map as well as over the course of the night. 
It is therefore used for all nights where the necessary data 
are available. 

4.2 Bolometer and sky noise analysis 

In what follows we refer to a single measurement (i.e. the 
average of 8 chop cycles subtracted between the 2 nod posi- 
tions) from a single bolometer as a 'readout'. Each readout 
represents approximately 2 seconds of integration time. 

The noise in our data varies with time and between 
bolometers. As the noise was found to vary significantly on 
timescales significantly longer that ~ 100 readouts, we de- 
cided to measure the noise in 128-readout groups for each 
bolometer. This is achieved by fitting a Gaussian to the 
histogram of readouts of a 128-readout group. Outliers in 
these readout distributions at the > 3a level are fiagged as 
ghtches. This filters out cosmic rays. Instrument systemat- 



ics should be removed by the nodding technique. We find, 
nevertheless, that there is still residual sky emission in the 
data, caused by the sky level varying on timescales less than 
that of the nod. This level is difficult to characterise, but 
appears to be common to all bolometers in the array, tak- 
ing into consideration the noise level of individual bolome- 
ters. The average in the form of the mode of the data is 
found by fitting a Gaussian to the data from all bolometers 
for each of the long/short wavelength bolometer groups for 
each time step. The DC-offset over the long/short array is 
then removed for each time step. Sky levels are evaluated 
independently at each wavelength so that two flux readings 
can be taken at different wavelengths from the data. Higher 
order terms of the sky-level are not removed in this method 
as they have been found not to be significant. 

A significant difference from the 8-mJy survey is the 
presence of a spike in the power spectrum of some of the 
bolometers at a period of roughly 16 sam ples, for dates from 
the end of 2002 onwards as seen in Fig.l^ teorvs et al ]|2004l : 
IWebb et alJl2004l) . Tests have shown that this effect is only 
significant for our data between December 2002 and June 
2003. This effect presents difficulties for the sky-noise re- 
moval program because the frequency is the same as that of 
the jiggle pattern itself. It has the effect of skewing the dis- 
tribution of the sky-noise, so that it is not correct to remove 
the same DC sky level for both those bolometers with the 
noise spike and those without. Where a significant number 
(greater than 10) of bolometers are affected in each map, 
the bolometers with and without the noise spike are treated 
as individual data sets for the sky-subtraction analysis. The 
process of bolometer noise measurement and sky subtraction 
is then iterated. 

In the method described in this paper we have not at- 
tempted to remove the noise spike, but have simply chosen to 
isolate the affected bolometers in our sky subtraction. There- 
fore we do not knowingly allow the presence of the spike to 
distort the sky subtraction process. As a result our assigned 
noise level s may not be optimal, although sources lost to 
the higher noise regions should be accounted for statistically 
using completeness simulations. An alternative noise spike 
treatment, which seeks to remove the noise spike from the 
data time-stream, has been applied to the SHADES data, 
and the results are almost identical. 

Implicitly these algorithms assume the S/N of point 
sources to be negligible in the bolometer readouts, which 
can readily be seen to be the case given the thousands of 
readouts that contribute to the 3-10 a sources in sub-mm 
surveys. However this is not true for calibrators, so for these 
the noise measurement and deglitching stages are replaced 
by assigning a noise value that consists of an arbitrary- mag- 
nitude noise equivalent fiux density (NEED) value which 
scales as the square root of the total integr ation time on 
each position on the sky llArchibald et alJl2002") . 

4.3 Flux calibration 

Calibration maps of Mars, Uranus, and secondary objects 
(the compact non-variable sources 16293—2422, CRL2688, 
CRL618, HL Tau and the variable sources IRC-l-10216 and 
OH231.8) have been used to calculate the fiux conversion 
factors (FCF) for each night jjenness et aljlioool) . The cal- 
ibration maps are taken with the same chop throws as the 




Figure 3. The spike in the power spectrum of the bolometer readouts. The upper row of plots shows the power spectrum of a typical 
(in this case the central) bolometer, before (left) and after (right) sky subtraction. The lower row of plots shows the same for a bolometer 
affected by the power-spike, showing clearly the spike at a period of roughly 16 samples (at the dotted line) and the fact that the 
power-spike itself has not changed. 



observations, and give information on the gain of the tele- 
scope as it changes throughout the night in response to dish 
temperature and environmental effects. The FCF is partic- 
ularly variable at times around sunrise/sunset. Therefore 
calibration observations at all three chop throws and posi- 
tions are taken at the start and end of each observing block, 
around sunrise/sunset. Extra, single chop throw, observa- 
tions are also made when many hours of observations have 
been taken without calibrating. 

A source extraction routine identical to that used in 
the final source extraction analysis is used to find the 
integrated voltage reading of the calibration source, and 
this is compared to standard fiux values, including adjust- 
ments for the known variability of IRC-l-10216 and OH231.8 
Jjenness et alJl2002l) . to calculate the FCF. This gives an 
FCF representing Jy for the total flux of a point source 
(see Section fS. in . This calibration factor corresponds to the 
typical notation of Jy per beam. Calibration factors can 
have an error of up to 10%, because of variation with time 
and measurement of the sources' integrated flux. The typical 
variation in FCF across any SHADES shift is approximately 
5%, though this is larger when data are taken around sun- 
rise or sunset, or when observing is extended into daylight 
hours. The FCF data are time-interpolated over the night, 
as opposed to using the mean value for each half of the night, 
as was done for the 8-mJy survey. 

Some FCFs do not follow the trend of the data for the 
rest of the night. The most common reason for this is be- 
cause the calibrator is extended. Mars behaved as a non- 
point source object due to its recent proximity to the Earth 
and IRC-l-10216 has a CO/dust envelope of at least 1 arcmin 
in extent. These observations were not used to calculate the 
night's FCFs. Approximately 10% of our calibration maps 
are affected in this way. When abnormal FCFs occur the cal- 
ibration changes throughout the night are tested using the 
pointing observations of the secondary calibrators named 
above. 

Where there are no usable calibration values or no 



agreement between the pointing observations and calibra- 
tion observations, standard calibration values are used. 
Standard calibration values were evaluated using our own 
sample of calibrator and pointing maps, reduced in IDL. 
These standard FCFs are approximately 20% lower than 
the JCMT standards using a SURF reduction, due to the 
differences in the method used to rebin the data. This sys- 
tematic difference demonstrates the importance of treating 
the calibrators in the same way as the sources. One test of 
these FCFs is to compare the flux densities evaluated for 
sources with e.g. the flux densities calculated using the 8- 
mjy survey-reduction method; the flux densities were found 
not to be systematically affected. Tests were also carried out 
on the calibration observations to investigate the effect of 
a noise-weighted source extraction as opposed to replacing 
the NEED with a flxed arbitrary value. The differences were 
much less than the calibration factor errors for the respective 
wavelengths (~ 1% at 850 /^m and ~ 10% at 450 /xm). 

4.4 Pointing corrections 

Telescope pointing observations are taken at intervals dur- 
ing the evening to correct for the positioning of the tele- 
scope on a source with respect to its pointing model. Whilst 
these offsets are used during data acquisition, drifts between 
pointing observations cannot be corrected for at the tele- 
scope and so must be appl ied retroactively. Once appl ied, the 
SCUBA2MEM program Jjenness fc LightfoodlTooi) . takes 
them into account when exporting the positions of each 
bolometer in the time series for further processing by the 
IDL pipeline. Due to the recent announcement of an error 
in the tracking model at the JCMT which h as affected dat a 
taken between August 2000 and April 2003 jTilanujbOOl) . 
we are making pointing corrections to the maps currently 
reduced. The errors are in azimuth, but are elevation de- 
pendent. 35% of the maps covered in this paper were taken 
during the affected period. However, the median absolute 
error of these when combining the offset between a pointing 
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observation and its science observation is 0.53", with only 
15% of the affected maps having an offset of > 1.5", which 
is of a similar level to the usual rms pointing error of 1.3" in 
each coordinate. The maximum error of any one map is 5 " 
and less than 0.5% of our total maps have a tracking error 
of > 3". 

4.5 Making zero-footprint maps 

The final images are produ ced using an optimal noise- 
weighted drizzling algorithm llFruchter fc HooMl2002^ with 
a pixel size of 1 square arcsec. This is the sa me method 
as th at employed in the SCUBA 8-m Jv Survev l|Scot t et all 
200i) and Hubble Deep Field North JSerieant et al.ll2003al : 
Borvs ct al. : 2003'l data reductions. Both output signal and 
noise maps were created, the signal in any single pixel be- 
ing given by the noise-weighted average of the bolometer 
readouts for which this is the closest pixel. Unlike a stan- 
dard shift-and-add technique, which takes the flux density 
in each detector pixel and places it into the final map over 
an area equivalent to one detector pixel projected on the 
sky, drizzling takes the flux density and places it into a 
smaller area in the final map. Although this significantly 
reduces the signal-to-noise ratio in each pixel, this approach 
helps preserve information on small angular scales, provided 
that there are enough observations to fill in the resulting 
gaps. The area in the coadded map receiving the flux from 
one detector pixel is termed the footprint. Our method is 
an extreme example of drizzling in which the footprint is 
selected to be as small as is practicable given the point- 
ing errors involved (termed the 'zero-footprint'), effectively 
placing delta-functions of flux into boxes one arcsec square. 
This also allows us to represent each bolometer's view of the 
sky as an independent measurement, i.e. there is no pixel-to- 
pixel crosstalk. The spatial variation in the sensitivity of the 
data is taken into account during source extraction rather 
than at this map-making stage. For the SHADES data, sep- 
arate zero-footprint maps are made for each chop throw and 
position angle. 

A possible problem with this methodology is that it is 
lossy compression: by coadding the data, individual pixels 
are combined together and information about the distribu- 
tion of flux readings is lost. However, it is possible during 
the running of the reduction pipeline to calculate statistics 
that help evaluate the self-consistency of the data contribut- 
ing to each pixel and therefore the reliability of the sources 
that are extracted from the map. 



5 THE IMAGE PROCESSING PROCEDURE 

5.1 Matched filter source-extraction techniques 

Convolution of the image with a point spread function (PSF) 
is the usual method for source extraction and is the optimal 
point source fllter in the case o f uniform noise dvon der Heidd 
Il979l: lEales et alJll999l . l2000ll . In the case of non-uniform 
noise, a method of minimisation of the data with 

the PSF is used, whic h can be expressed as a convolution 
(for further details see lSerieant et a'l]|2003al) . This method 
is optimal for point source sensitivity, but is not optimal for 
spatial resolution and assumes that the sources will not be 



resolved or confused. These assumptions can hold for the 
450 nm data, but are more difficult for the 850 ^m data in 
which the beam size is larger and confusion noise is more of 
a problem. 

Indeed, both by examining the data and through clus- 
tering arguments it is clear that some sources are partially 
blended and some are confused. For the science goals of 
SHADES, it is important that sources fairly close together 
on the sky can be properly separated, because otherwise 
much of the potential power of the first radial bin in the 
angular correlation function, w{9), will be lost. 

Starting from the simple case of unresolved point 
sources: gener a lising t he source extraction methodology of 
ISerieant et al.l l|20033)i the best fit (minimum x^) point 
source flux at any point on the sky is given by 

where F is the best flt flux, c indicates the chop throw and 
position angle combination, S is the image signal, W is the 
reciprocal of image variance, P is the point spread function, 
and (g) indicates convolution. The S/N image is derived using 

where 6F is the error on F. In our case, P includes the 
positive beam and both negative sidelobes. 

Simulations shown in Figs.|3 and El indicate that this 
multiple chop source extraction yields similar S/N at the 
peaks as was inserted into the map at the source positions, 
while removing the problem of negative-chop holes coincid- 
ing with other sources. 

Sources are currently identifled as peaks in the S/N 
maps, using a connected pixel approach using the IDL 
LABEL_REGION routine (http://www.rsinc.coml to find the 
peak. Sources were identified using multiple S/N threshold 
cuts between 3. Oct and the peak S/N of the map. These mul- 
tiple cuts allow separation of blended sources, where there 
are separate peaks, (see Fig.|3|l. 

It can be shown that the method of multiple thresh- 
old cuts can deblend Gaussian sources with equal fixed full 
width half maximum (FWHM) having minimum separation 
between peaks of ^-2GHn(p) where FWHM = 2V2/n2a- 
and p = ratio of the individual sources' peak S/N where 
< p < 1. Sources are found with peaks > 3. Oct and we 
have a maximum S/N of ~8. We would therefore consider 
peaks with separation less than ~8.33". This is compara- 
ble to the beam size of the SCUBA beam and therefore this 
should only become a significant issue if sources are found 
with much higher S/N. For the purposes of our completeness 
and reliability simulations, and for our future source count 
derivations from this analysis, we only use the method of 
multiple cuts described above. 

Since source counts are not the only science goal, it 
may be useful to attempt deblending of source pairs which 
are closer than the beam width. To that end a more so- 
phisticated source deblendi ng algor ithm has been developed, 
building on that detailed in lScott'ct al. ( 2 00^ . In the Scott 
et al. technique the flux densities of all significant peaks are 
fitted simultaneously using a maximum-likelihood technique 
at the positions of the peaks. We have extended this to fit 
to the spatial position as well as the fiux, using the position 
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Table 2. The best-fit parameters for fitting two sources 
to the area around Lockman 850.1 and 850.8. The posi- 
tion and flux errors are formal 1 a errors computed from 
the covariance matrix. 

Position Flux (mjy) S/N 

10 52 01.31 +57 24 44.4 5.2±1.1 4.9 
10 52 00.33 +57 24 19.0 9.3+1.2 8.0 




1 ^ i 

O f ^ 

20 40 60 80 
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Figure 4. A connected pixels approach on point-source-filtered 
data with multiple threshold cuts can identify blended sources 
which have two separate peaks. The A.bcr cut would identify only 
the left hand peak, whereas the 3a cut would only identify the 
right hand one. The 3.5(7 cut would identify two peaks. 

and fluxes of sources from the source-extraction method as a 
starting point and fltting between one and six sources at each 
position. This becomes a computationally tractable problem 
when we consider only the pixels around the > 3cr peaks in 
the map that would contribute to the flux at that position. 

Adding additional sources to the fits can improve the 
reduced x^y but these new sources may not necessarily repre- 
sent the underlying distribution. Distinguishing between two 
close point sources and some other extended structure (such 
as a single extended source) implicitly involves constraining 
the quadropole moment of the image, and as [Lucv L1992,) 
has shown, the signal-to-noise requirements for constraining 
the higher order multipoles are extremely stringent. Never- 
theless, the positions of known sources may be refined using 
our multi-wavelength data. Such refinement is optimal if one 
simultaneously fits all the relevant multi-wavelength data, 
and this can be done within our methodology. 

As an example of this methodology, we have attempted 
to deconvolve Lockman 850.1 and 850.8 using the 850 ^m 
maps from the 8mJy survey. The reported fluxes of these 
sources in the 8mJy survey are 10.5 ± 1.6 mJy and 5.1 ± 
1.3 mJy respectively . We performed the multiple source flt- 
ting, and the best fit is given in table |21 the covariance ma- 
trix of which shows a weak correlation of fiuxes with posi- 
tions and between the fiuxes of the two sources. 

5.2 Emerson-II deconvolution 

An alternative method to construct images is to use the 
multiple chop strategy to recover the modes missing in any 
single chop, through the Emerson II deconvolution algorithm 



jEmersonlll995l: Ijenness et ai]l200(]ll . This has been shown 
to be effective in reproducing sources in data with two 
chop throws and one position angle, as was the observing 
strat e gy applied to the Hu bble Deep Field jHueh es et alJ 
Il998l: ISerieant et aI.lf2003a^ . This is not the only possible 
methodology, but it has the advanta ge of having clear prece- 
dents in Galactic astr onomy (e.g. Ijohnstone et al.l liood : 
IPierce-Price et aljlioooll . 

The difference image produced by the chop process dur- 
ing observation can be considered a convolution of the sky 
with the chop function. Recognising that a convolution in 
real space is the same as multiplication in Fourier space, 
the Emerson-II algorithm reconstructs the image by effec- 
tively dividing by the chop function in Fourier space. Using 
this technique, modes in Fourier space are lost where the 
Fourier transform of the chop function is zero, but these 
can be filled in using modes from data taken at other chop 
angles. This method has the benefit of using the flux from 
the negative chop-holes and folding it back onto the source 
position. The down-side of this for the Hubble Deep Field 
data is that modes are lost altogether because the 30" and 
45" are in a 2:3 ratio. In contrast, the chop strategy imple- 
mented for SHADES is well suited to this reduction method 
because the chop throws are incommensurate and the par- 
ticular values of 30", 44" and 68" have been shown pre- 
viously to work well with the Emerson-II technique. Fig.|K| 
shows an example of the source-extraction method of the 
primary pipeline, in which a number of sources, simulated 
using the 8-mJy survey source positions and fluxes, have 
been extracted using the multiple-chop source extraction 
described above. Fig.|3 also shows the alternative method 
of reconstructing the image using the Emerson-II algorithm 
(t he analogous algorithm for SCUBA scan maps is discussed 
in Ijohnstone et al]|2000tl . These two algorithms give superfl- 
cially extremely similar reconstructions; this is partly be- 
cause the multiple chop throws and position angles dis- 
tribute the negative sidelobe fluxes over a large number of 
positions. Although the Emerson-II method may be useful 
for producing cosmetically clean images, we have not in- 
vestigated the presence of possible artefacts made by the 
map- making process, and we prefer to use the direct method 
outlined in Section f5.1l to flnd sources. 

Another approach is to use an iterative reconstruction 
scheme, motivated by Cosmic Mic rowave Background map- 
making methods (e.g. IWright et aL ^ 1^96) and successfull y 
applied to SCUBA scan-maps bv Ijohnstone et al.l i200dl . 
The triple-beam pattern used in jiggle-map mode is more 
problematic for this approach, since each datum is the dif- 
ference between one map value and the average of two oth- 
ers. Investigation of the iterative recons truction of SH ADES 
maps has been of only limited success l|Lepagell200i) . 

5.3 Comparison of these two methods 

To assess the relative merits of these two methods we com- 
pare the completeness and reliability of sources found using 
simulations (see Fig.EJ. The simulations have two compo- 
nents. First, a single Gaussian chop proflle source is added 
at a random position in the real zero-footprint chop maps. 
We then attempt to recover the source (within 7" of the 
input position and within a factor of two of flux and S/N 
>3.5) and information about the position and flux of the 
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Figure 5. A simulation of source-extraction using the multi-ciiop technique. Top left: The input sources as they would appear with 
no noise or chopping. Top right: A source extracted map using only one chop throw/PA image. This is the method that was used for 
the 8-mJy survey, such that the source extraction routine causes a double chop function to be seen. This causes blending of some of 
the sources with the chop functions of other sources. Bottom left: A source extracted map using six chop throw and PA combinations, 
each with the individual map having \/6 times the noise used in one chop image. Bottom right: The same simulated map was used in an 
Emcrson-II dcconvolution method to produce a map without significant chop residues. 



output source is retained. This gives information about the 
completeness and flux boosting. In the second method, a 
map is entirely simulated using the source counts of Scott 
et al. (2005) , the level of clustering seen in the 8-mJy survey 
(Scott et al. 2002), and Gaussian random noise is added us- 
ing the real noise map as input. Sources are then extracted 
as described above and the results used for calculating the 
reliability of the extracted sources, which we here define as 
the number of 'real' sources found as a fraction of the total 
number of sources found by the source identification process. 

The reliability of the Emerson-II method appears higher 
for a given S/N (crosses in Fig.jSJ than the matched-filter 
method (filled triangles), but this is because the S/N for an 
individual source is lower in the Emerson-II method. Af- 
ter correcting for this S/N difference (open circles) it is 
clear that the matched filter has a higher reliability than 
Emerson-II for a given source (i.e., rather than a given S/N). 



6 RESULTS AND ANALYSIS 
6.1 Progress 

A total of 1843 individual jiggle-maps, observed over 139 
nights up to February 2004, have been coadded into zero- 



footprint maps covering 720 arcmin^ . The median noise level 
in the 850 /xm data is 2.2 mjy and that of the 450 /im data 
is 25 mJy. The data were taken within a range of weather 
conditions (JCMT grades 2-3) such that the mean tsso ~ 
0.27±0.07 and T450 = 1.45 ±0.44. 92% of the data have rsso 
in the range 0.20-0.48 and T450 in the range 0.94-2.78. 

The areas covered by the survey to February 2004 are 
shown in Fig.Q Note the uniformity of the noise levels 
within the central regions of the maps as designed, with 
the higher noise regions visible around the perimeter where 
further observations are due to be taken. Although the 
SHADES data are taken only within the fixed weather con- 
ditions outlined above, the data taken at the start for the 
SXDF were taken with consistently poorer than average 
weather (grade 3), hence the higher noise level at the centre 
of the SXDF map. However, the depth is consistent across 
that area and varies only by ~ 0.7 mJy from the rest of the 
map. Also note the deep strip in the Lockman Hole East, 
taken during the SCUBA 8-mJy survey. 

The source-extraction method can also yield sources 
near the edge of the map which we choose to reject due 
to insufficient coverage/sampling and/or high noise values. 
These can be identified and removed from the source lists 
by creating a mask of the integration time convolved with 
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Figure 6. Le/t; Comparison of the output S/N of the input sources for the matched filter and Emerson-II deconvolution methods, for 
the simulations shown in Fig. |S| The dashed lines show S/N = 3 for each of the methods, and the line for which the two methods would 
have equal S/N. Right: The reliability of sources extracted from the matched- filter method (filled triangles) and Emerson-II deconvolved 
maps (crosses). This is based only on the sources in the top-left panel, which is far fewer than the more extensive simulations in Fig. ISl 
however, since it is the same sources being extracted in both matched-filter and Emerson-II methods, the relative reliabilities can still be 
usefully compared. For a fixed S/N the reliability appears to be higher for the raw Emerson-II deconvolution, but this is easily shown to 
be an artefact of the poorer S/N of Emcrson-II for a fixed input flux (left). After adjusting for the poorer S/N in the Emcrson-II method 
(open circles), the reliability is shown to be less for the Emerson-II deconvolved maps than the matched-filter method. Thus, for example, 
a 3.5 (T source in an Emerson-II map is intrinsically brighter than a 5 cr source in a matched filter map, and it is unremarkable that such 
brighter sources can be extracted more reliably; however, an 8mjy source extracted from an Emerson-II map has a lower reliability than 
an 8 mjy source from a matched-filter extraction. 
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Table 3. Comparison values of reliability {R), completeness (C) 
flux boosting factor (B) and positional errors (D) at different 
threshold cuts for our two fields. 

the beam (which in our case rejects those regions near the 
edge of the map with httle coverage) and rejection of those 
sources with large flux-density errors (typically greater than 
10 mJy at 850 /im and greater than 100 mJy at 450 fim). 

Using these criteria, simulations were carried out as de- 
scribed above in section 15.31 for the full SHADES interim 
maps. These gave completeness, reliability and flux boost- 
ing effects as shown in Fig. |H] The expected completeness 
and reliability at 3.0, 3.5, and 4 a are shown in Table |3 

Table |1| gives the number of sources at different signifi- 
cance cuts for each of the SHADES fields using the source- 
extraction method outlined above. In order to estimate the 
source-detection density, we have conservatively considered 



only those sources that are also found in two independent 
reductions, our so-called consensus list (which will be dis- 
cussed in future papers). This in effect means that, for 
sources of modest significance in the range 3-3.5 a, we only 
consider those that are reproduced in all four reductions un- 
dertaken so far. 

In the Lockman Hole East map, 69 sources have been 
identified with S/N > 3 of which 47 have S/N > 3.5. In 
the SXDF, 61 sources have been identified with S/N > 3, of 
which 53 have S/N > 3.5. Thus, with 40% of the data taken, 
SHADES has produced a sample of 100 sources at 850 ^m 
with S/N > 3.5. 

The implied surface density of sources with S/N > 3ct 
is therefore 653 ± 57 sources deg"^ The error on the esti- 
mated source density is calculated using the Poisson error 
on the data (i.e. the number of sources observed) , corrected 
for the area considered. The source density is likely to be an 
underestimate since parts of the maps have only been cov- 
ered by a single chop at this time and consequently have a 
higher noise level. However, the uncorrected source density 
is consistent with a surface density of point sources more 
than sufficient for the science goals described in this paper. 

Fig. |5] shows the S/N histogram of the new maps. A 
Gaussian can be fitted to the S/N data, though at high S/N 
levels an excess above this fit shows there are real sources in 
the map. This is seen to be the case with the SXDF obser- 
vations; in the Lockman Hole, the effect of the combination 
of the deep strip (at one chop throw only) and the noisier 
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Figure 7. Noise maps of the {Top Row) Lockman Hole East and {Bottom Row) SXDF for data taken up to February 2004. For the 
850 /jm data {Left Column), the lowest contour level is 0.71 mjy with higher contours spaced by 0.71 mjy to a maximum level shown in 
the map of 5mjy. For the 450 /^m data {Right Column), the lowest contour level is 7.1 mjy with higher contours spaced by 7.1 mjy to 
a maximum level in the map of 50 mjy. Note the high levels of noise around the edges of the map, which will be reduced when more 
observations are taken. 



edges is evident from the higher excess of pixels at higher 
S/N compared to the SXDF. The Lockman Hole deep strip 
has one chop throw only, so we also see an excess at large 
negative S/N due to the side-lobes of the sources in this 
deep strip. This negative excess is also present in the SXDF 
maps, but at a much lower level because our chop strategy 
deliberately reduces the chop holes (see Fig.j^l. If there were 
no sources in the map, the plot should follow the Gaussian 
shape. Instead, it is possible to see the statistical detections 
at higher S/N as an excess of pixels with that S/N, espe- 
cially at 850 /im. Also, assuming Gaussian random noise, we 
would expect to find 0.62% of the survey beams to contain 
spurious detections at the 2.5 a level. This would mean that 
if there were no real sources in the map, we would expect to 
see a total of about 60 spurious sources in the Lockman Hole 
maps so far, and 44 spurious sources in the SXDF assuming 
a beam size of 14" at 850 fj,m. Instead we detect approx- 
imately 270 source candidates at > 2.5a in the Lockman 
Hole map, and 250 source candidates in the SXDF map. We 
have extended this analysis to other a cuts in Table 21 

6.2 Comparison with the 8-mJy survey 

A re-analysis of the 8-mJy Survey Lockman Hole East data 
was carried out in order to test the new pipeline. Of the 21 



Table 4. Numbers of sources at 850 ^m greater than the S/N 
cut used in the source extraction procedure. Note the steep neg- 
ative slope. Numbers in brackets are the estimated number of 
statistically spurious sources assuming Gaussian noise. 



S/N 


N{> S/N) Lockman 


N{> S/N) SXDF 


1.0 


1773 (1530) 


1372 (1128) 


2.0 


648 (220) 


524 (162) 


3.0 


98 (13) 


106 (10) 


5 


46 (2.25) 


40 (1.66) 


4.0 


24 (0.3) 


16 (0.2) 


5.0 


6 (0) 


4(0) 


6.0 


2 (0) 


2 (0) 


5 


1 (0) 


1 (0) 


Median la noise 
at 850 fim 


2.28 mJy 


2.14mJy 



published sources at greater than 3.5 a" jjSco^^^I |200^, 
4 (LE850.9, 10, 15, 20) were rejected bv llvison et'aO (|2002^ 
because they have crgso > 3 mJy. This leaves 17. Of these, 
12 were found using the new analysis method. Of the 12 
8-mJy survey sources with S/N greater than 3.5 that have 
been reproduced, two lack a radio ID (LE850.4, LE850.11). 
The possible implications of this were outlined in Section 
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Figure 8. Completeness (top), flux boosting (middle) and reliability (bottom) for the Lockman Hole East (left) and SXDF (right). 



12.11 For those sources reproduced, Table |S] shows the S/N 
of the source in the new reduction. For those sources not 
reproduced, the S/N at the position of the original 8-mJy 
Survey 'detection' is shown. 

Of the sources previously detected between 3-3.5 a, only 
two have been confirmed as peaks in the S/N distribution, 
and one source has positive flux at the position with S/N 
of greater than 3. Extra chop data in the area have con- 
firmed one source at a significance level greater than 3.5 cr 
(LE850.29). 

Only one of the nine missing sources originally found 



at a significance level greater than 3.5 ct have a robust ra- 
dio identification. This is source LE850.6, which has been 
resolved into two sources, but neither new peak lies within 
7" of the original source p osition. Note that further analysis 
of the llvison et alj ||2002|) rad io identificat i ons in the Lock- 
man Hole was carried out bv lGreve et al.l (|2004) and their 
results are used here. Source LE850.13 has been lost using 
this data-reduction method and does not return when all six 
chop data are added to the region. 



One possibility is that the missing IScott et al.l (12002 

sources were spurious, perhaps due to weak bolometer noise 
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Figure 9. S/N histograms of the point-source-filtered SHADES maps, with a Gaussian fit to the data plotted as a dashed line. The 
noise levels are derived from the bolometer readouts using the methodology described in Section 14.21 which are used in the map making 
(Section 14.51 and source extraction fScction 15.11 procedures. Top:Left Lockman Hole 850 /xm map - mean -0.0147, variance 1.067. 
Top:Right Lockman Hole 450 /im map - mean -0.00682, variance 1.020. Bottom:Left SXDF 850 /im map - mean -0.0374, variance 1.152. 
Bottom:Left SXDF 450 /^m map - mean -0.002569, variance 1.111. Note the very close fit to the data at 450 /^m and the excess S/N at 
850 Atm. 



spikes that escaped clipping in the earlier analysis. However, 
this cannot be the whole explanation since there is a weak 
positive si gnal in the posit ions of the missing sources (as 
quoted bv lScott et al.ll20o3 . see Tabled in our refined re- 
analysis. 

One other possibility that could lead to apparently spu- 
rious sources, in the sense that the sources seem to be non- 
repeatable, is Eddington bias jEddingtonlllQiaTl. This bias, 
sometimes confused with Malmquist bias jTeerikorpilll997ll . 
describes a feature of observing a population of objects 
above a given flux limit with a negative-sloping source count. 
Random errors in the flux measurements of the objects can 
systematically alter the source counts measured above the 
flux limit, such that more sources have their flux densities 
boosted above the flux limit, than those that fall below the 
limit. This makes there appear to be more sources close to 
the flux limit than in the true population, the efl'ect being 
more pronounced for lower S/N thresholds. A simulation of 
this effect on 10,000 sources can be seen in Fig. 8. When ob- 
servations are repeated, different sources are boosted above 
the flux limit while others again drop below the flux limit. 
The number of sources observed in each of the first and sec- 
ond observations with a S/N in the range 3-3. 5cr is ~2600 
but the number of sources with S/N in the range 3-3. 5cr in 



the first observation and > 3a in second is only 535. In this 
way it is possible to have up to 80% of the 3-3. 5(t sources not 
appearing in both samples. This 80% is made up of those 
sources that are fiux boosted in the first observation and 
not in the second, or vice versa, and it will happen even if 
all the 3-3. 5a" lists are 100% reliable. By re-analysing the 
noise in the 8-mJy survey maps, we have re-weighted the 
observations, and effectively resampled the noise, although 
the effect will probably be smaller than that shown in Fig. 
8 because the raw data are the same and it is the dominant 
sky noise that has been re-measured. This might explain why 
the lower significance 3-3. 5a source lists from the 8-mJy sur- 
vey data reduction and the SHADES data reduction contain 
only 3 sources in common. 

Fifteen new > 3.5a sources have been identified using 
the new source-extraction method in the 8-mJy survey area, 
of which only seven are not also in the consensus lists. Three 
of these fifteen have noise greater than 3mJy. One of the 
new sou rces in the consensus list was first reported in full 
detail in lSerieant et al.l i2004ri with two possible Spitzer de- 
tections which agree strongly with the radio identifications 
presented there. Although not detected at 450 and 1200 /xm, 
weak positive signals of 1.6a and 1.8a have been found at 
this source position in both maps. The four sources not in 
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100.0 



Flux {Ia=l flux unit) 

Figure 10. A simulation of Eddington bias for a sample of 10,000 
sources with one cr equal to one flux unit. Although flux boost- 
ing does occur for a fraction of the simulated sources, the ac- 
tual sources detected between 3-3. 5fT varies between individual 
observations because of the random effect of the noise. Of the 
10,000 sources simulated here, ~2600 were found with S/N be- 
tween 3— 3.5(7 in the first and/or second observations, but only 
535 were found with S/N in the range 3-3. 5fT in the first ob- 
servation and > 3<T in the second. Therefore, when applying a 
significance threshold, the lower-significance source lists derived 
from each observation are markedly different, despite the fact 
that both observations yield clearly consistent Eddington-biased 
source counts. 



the consensus list and with noise less than 3mJy are listed 
in Table IHl 

A future paper will perform a detailed comparison of the 
pipelines over all the SHADES area, i.e. not just restricted 
to the 8-mJy Lockman Hole field. 



7 CONCLUSIONS AND SURVEY PROGRESS 

To February 2004 the SCUBA Half Degree Extragalactic 
Survey areas covered a region of 720 arcmin'^ (approxi- 
mately 40% of the total expected area) using 1843 individual 
jiggle-maps that have been observed over 139 nights during 
the period March 1998 - February 2004. 

Source extraction from these maps indicate at least 
653 ± 57 sources deg-2 having S/N > 3 at 850 /im in these 
survey fields (uncorrected for Eddington bias), consistent 
with a surface density of point sources more than sufficient 
for the science goals described in this paper. 

This paper has outlined the SHADES survey goals, 
data-taking and data reduction strategies, as part of which 
a new SCUBA reduction pipeline has been developed using 
IDL, based upon the reduction pipeline used for the 8-mJy 
survey. A test of this data-reduction and source-extraction 
method was made by comparing the new maps with the 
sources extracted from the original 8-mJy survey Lockman 
Hole East maps that form a sub-set of the SHADES maps. 
Of the 17 more secure (> 3.5(t) sources in the 8-mJy sur- 
vey, 13 have been reconfirmed of which 11 have radio iden- 
tifications. 13 new candidate sources with S/N > 3.5 and 
(""850 < 3.5 mjy) have been identified using the new data- 
reduction method, nine of which appear in the consensus 



Table 5. The S/N of the sources using the new data reduction 
software, applied to the 8-mJy survey data only. The S/N quoted 
is of the source that corresponds to the old source (Less than 7 " 
from the original 8 -mJy survey source) . Radio identifications arc 
from lCreve et aTl i2004l) and selected such that the probability 
that an apparent radio source identification could be the result of 
chance is p < 0.05. Of the nine sources in the original S/N > 3.5 
list that have now been lost, 4 have already been rejected by Ivison 
et al. (2002) as likely reduction artifacts or seriously fiux-boosted 
sources extracted from the high-noise regions of the original 8- 
mjy maps (LE850.9, 10, 15 and 20, marked by an asterisk). Source 
LE850.6 has been resolved into two blended sources, but does 
not strictly have an peak within 7 " of the original source. The 
new sources found from our reduction are named with the prefix 
LE850.M, and do not appear in the current consensus lists. Not 
listed here are the eight sources found by our algorithm that do 
appear in the consensus reduction. 



8-mJy source 


8-mJy S/N 


New S/N 


Radio ID? 


LE850.1 


8.1 


8.0 


Yes 


LE850.2 


5.2 


5.1 


Yes 


LE850.3 


5.1 


3.9 


Yes 


LE850.4 


5.0 


5.1 


No 


LE850.5 


4.6 


2.0 


No 


LE850.6 


4.5 


2.8 


Yes 


LE850.7 


4.5 


4.0 


Yes 


LE850.8 


4.4 


3.8 


Yes 


LE850.9* 


4.2 


0.8 


No 


LE850.10* 


4.2 


1.5 


No 


LE850.1lt 


4.1 


4.3 


No 


LE850.12 


4.0 


4.3 


Yes 


LE850.13 


3.7 


1.4 


No 


LE850.14 


3.6 


5.2 


Yes 


LE850.15* 


3.6 


1.8 


No 


LE850.16 


3.6 


4.3 


Yes 


LE850.17 


3.5 


1.3 


No 


LE850.18 


3.5 


4.2 


Yes 


LE850.19 


3.5 


1.7 


No 


LE850.20* 


3.5 


0.2 


No 


LE850.21 


3.5 


4.0 


Yes 


LE850.27 


3.4 


3.3 


N/A 


LE850.29 


3.3 


4.7 


N/A 


LE850.32 


3.2 


3.2 


N/A 


LE850.M1 




3.88 


Yes 


LE850.M2 




3.77 


? 


LE850.M3 




3.73 


N/A 


LE850.M4 




3.51 


N/A 



t Source rejected in^isor^^^^ ^00^ for 

its high noise value but considered robust here 



between other reductions which will be discussed in future 
papers. 

A full presentation and analysis of the sub-mm source 
counts derived from the interim SHADES maps discussed 
here will be presented in a separate paper. This paper will 
also compare the results of the data reduction and source 
extraction described here, with the outcome of three fur- 
ther independent reductions/extractions carried out at If A 
Edinburgh, INAOE Mexico, and UBC, Vancouver. 

The SHADES consortium maintains a public web page 
at |http77/ www. roe . ac.uk/ if a/shades/J 
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